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Sound Links 

Fidd of the Invention 
5 The present invention relates to the encoding of h>perlinks in sound signals. 

Background of the Invention 

In recent years tiiere has been an explosion in the number of services available over the 
World Wide Web on die public intmiet (generally referred to as die 'Veb"), tbe web being 
1 0 conq)osed of a myriad of pages linked together by hyp^links and delivered by servers on 
request using the HTTP protocol. Each page comprises contOTt marked up with tags to 
enable the receiving application (typically a GUI browser) to render the p^ content in the 
manner intended by the page author; the markiq> language used for standard web pages is 
HTML (HyperText Markiq> Language). 

15 

However, today &r more people have access to a telephone than have access to a conq>uter 
with an Intemet coimection. Sales of cellphones are outstripping PC sales so that many 
people have already or soon will have a phone within reach where ever diey As a result, 
th^ is increasing interest in being able to access web-based services fix>m phones. 'Voice 
20 Browsers' oflFer the promise of allowing everyone to access web-based services firom any 
phone, making it practical to access the Web any time and anywhere, whether at home, on 
the move, or at work. 

Indeed, because many items around the home and ofiQce have a soimd capability, it is 
25 attractive to use sound, not only for passing information to /. from / betwem humans, but 
also for passing functional information such as URLS, to and betwem items of equ^mient. 
JP 1 1-1 19974 (Sony) describes various ways of using sound URLs, these being DTMF 
soimd sequences that decode to character URLs. 

30 A disadvantage of audible sound URLs is that they are generally highly unattractive to 
humans as they posses a fairly random structure of sound (or so it appears to the human 
ear). Whilst it is possible to hide sound data such as URLs in other, pleasanter sounds 
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using sound watermarking techniques, this generally requires complex embedding and 
retrieval systems which is expensive. 

It is an object of the present invention to provide improved soimd URLs and methods for 
5 their usage. 



Summary of the Invention 

According to one aspect of the present invention, there is provided a method of encoding a 
1 0 URL in soimd, wherein the characters of the URL are mapped to sound features in a sound 
output, the nature of the sound features and of the mapping between characters and sound 
features being such that at least certain character combinations that occur jfrequently in 
URLs produce soimd sequences of a musical character. 

15 According to another aspect of the present invention, there is provided a method of 
decoding a sound sequence into a URL, wherein sound features of the sound sequence are 
mapped to characters of the URL, the nature of the sound features and of the mapping 
between soimd features and characters being such that sound sequences of a musical 
character represent at least certain character combinations that occur frequently in URLs. 

20 

The present invention also encompasses apparatus for implementing the foregoing 
encoding and decoding methods. 

25 

Brief Description of the Drawings 

A method and apparatus embodying the invention, for encoding and decoding soimd 
URLS, will now be described, by way of non-limiting example, with reference to the 
accompanying diagrammatic drawings, in which: 
30 . Figure 1 is a block diagram showing the main functional blocks of a tone URL 
translator; 



• 
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. Figure 2 is a diagram illustrating the mapping between tones and characters for a 

first tone-URL encoding/decoding scheme; 
. Figure 3 is a diagram illustrating the mapping between tones and characters for a 

second tone-URL encoding/decoding scheme; 
5 . Figure 4 is a diagram illustrating a preferred conversion scheme between characters 

and sound codewords; 
. Figure 5 is a diagram showing the use of a service system to translate site codes to 

site URLs; and 

. Figure 6 is a diagram showing the use of the Figure 5 service system by a network 
1 0 voice browser. 



Best Mode of Carrying Out the Invention 

Figure i depicts a tone URL translator 1 for receiving a sequence of tones that encode the 
1 S characters of an URL. The tones are received as sound through microphone 2 but may also 
be received in analogue or digital electrical signal form. A converter 3 converts the 
received tone signals into a common internal format before passing the tone signals to a 
unit 4 that determines the fi-equencies of the received tones and generates corresponding 
respective tone codewords. These soimd codewords are supplied to irnit 5 where they are 
20 converted into a URL character string according to a predetermined moping process. 

Figure 2 shows a first mapping scheme for converting between tones and character codes. 
In this example, there is a one-to-one correspondence between tones and character codes — 
that is, each tone maps to one character code. Li Figure 2, the left-hand column shows the 
25 set of available tones 6 in increasing order of firequency, the center column corresponds to 
the set of tone codewords 7 arranged in increasing codeword value, and the right-hand 
colimm is the set of character codes in standard order (for example, the ASCII character 
code set arranged in increasing order of binary value). 

30 Moving from a tone to a character code (or vice versa) involves two mappings, namely a 
first mapping 9A between tone and tone codeword, and a second moping 9B between tone 
codeword and character code. The overall mapping between tones and character codes is a 




combination of the two mappings 9A and 9B. In the Figure 2 example, both mappings 9A 
and 9B are simple one-to-one mappings with the values on each side of the mappings both 
increasing /decreasing as the sets 6,7 and 8 are progressed through. 

5 hnplementing the Figure 2 scheme using the Figure 1 translator involves the unit 4 carrying 
out the mapping 9A and unit 5 carrying out the mapping 9B. It will be appreciated that ttie 
encoding process by which URL characters are converted to tone sequences is the reverse 
of the decoding process carried by translator 1 and can be effected by appropriate 
apparatus. 

10 

Whilst the foregoing mapping of Figure 2 is extremely simple and therefore easy to 
implement, it suffers from the disadvantage that the sequence of tones produced when any 
particular URL is encoded, is likely to be xmpleasant to the human ear. 

15 To alleviate this, a modified ms^ping is used, one example modified mapping being 
illustrated in Figure 3 . In this example, the mapping 9A between tones and tone codewords 
is modified such that the overall mapping between tones and character codes results in 
frequently used character combinations of URLs producing pleasant sound sequences (that 
is, sequences of a musical character where ''musical" is to be understood broadly, including 

20 chimes and the Uke). The character combinations so encoded are, for example, the generic 
top level domain names and ^Vww". 

The mapping 9B could alternatively or additionally have been modified to produce the 
desired musical sequences. 

25 

It is also possible to choose a mapping that gives a musical sequence for a complete URL. 

In the foregoing encoding/decoding schemes, there is a one-to-one correspondence 
between tones and character codes and, as a consequence, it is possible to omit one of the 
30 mappings 9A / 9B and have tones mapping directly to character codes. However, using 
intermediate tone codewords gives a degree of flexibility permitting improved encoding. 



More particularly, if the character set has 256 characters, then producing 256 tones within 
the frequency band of a telephone voice circuit (over which it maybe desired to pass sound 
URLs), means that the resultant tones are very close together. It is preferable to have a 
smallCT number of tones - for example 64 tones. However, to efficiently code characters in 
5 this case requires that each group of three characters is encoded by four tones. How this 
can be conveniently done is illustrated in Figure 4 where each of three characters is 
represented by an 8-bit code. These codes are concatenated to form an intermediate 24-bit 
word 50. Word 50 is then split into four 6-bit tone codewords; the 6 bits pemiit 64 possible 
tone codewords which therefore provide an efficient representation of the 64 tones. 

10 

Figure 4 represents a four-to-three mapping between tone codewords and character codes 
(mapping 9B), the mapping 9A between tones and tone codewords remaining a one-to-one 
in this example (though this can be varied). With this encoding scheme, it is more 
complicated to determine the details of the mapping (for example, moping 9A) required to 

1 5 generate pleasant tone sequences for particular character groups since the characters must 
be considered in groups of three. However, since the main target character groiq)S (generic 
top level domain names) are three-character groups and since leading spaces can be used to 
ensure that each such group is taken as a whole during the encoding process, determining a 
mapping for producing pleasant soimds for a small set of character combinations is a 

20 manageable task. 

Figure 5 shows an arrangement which also enables pleasant tone sequences to be used to 
pass URLs; as will be seen, this arrangement preferably, but not necessarily, makes use of 
tone-character mappings such as depicted in Figure 3 which associate pleasant tone 
25 sequences with common character sequences. 

More particularly, end-user equipment 10 has a web browser 11 which can be used to 
contact web sites over the intemet 20. Equipment 10 is provided with a soimd input 
microphone 1 3 for receiving soimd sequences 1 2 which represent, or can be used to obtain, 
30 website URLs. The sound sequences are constituted by tone sequences representing 
characters according to mappings such as illustrated in Figure 2, 3 and 4. The sound 
sequence signals from microphone 1 3 are passed to translator 1 4, which is similar in form 
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to translator 1 of Figure 1, and the resultant character sequences are fed to a discriminator 
unit 15. The role of this unit 15 is to determine whether a received character sequence 
represents a general URL (in which case it is passed to browser 1 1 for iise in accessing the 
corresponding website), or whether it represents a site code intended to be translated into a 
URL; in the present example, service system 25 with URL **mapmusic.com" provides such 
a translation service. 



The sound sequence 12 depicted in Figure 5 corresponds to the input of a site code. The 
sound sequence is made up of four segments, namely a "start" segment 12A which can be a 
10 special character sequence indicating the start of a sequence, a sovmd segment 12B that 
encodes characters indicating that a site code is being provided, a sound segment 12C 
encoding the site code itself, and a stop segment indicating the end of the sequence 12. The 
start and stop codes would typically also be used to delimit a tone sequence directly 
encoding a URL. 

15 

When the discriminator sees the characters indicative of a site code, it knows that the next 
set of characters constitutes the site code and this code requires translation into a URL. The 
indicator characters can, in fact, be the URL of the translation service system - in this 
example '*mapmusic.com*'. 

20 

The discriminator 1 5 next passes the site code to imit 1 6 which proceeds to contact service 
system 25 over the intemet 20 (see arrow 22), passing it the site code 18. A map-site-code 
block 26 at service system 25 does a simple database lookup in database 28 to convert the 
site code into the corresponding site URL which it then returns to the unit 16 (see arrow 
25 23). Unit 23 then passes the URL to browser 11 which uses it to contact the website 
. concemed - in this case, website 40. 



The Figure 5 arrangement permits the use of site codes chosen because they sound pleasant 
when encoded into sound, the corresponding code characters being of Uttle relevance 
30 provided they are unique. Furthermore, if the mapping used in the encoding scheme has 
been selected such that both the start and stop segments, as well as the "mapmusic.com" 




URL all have pleasant sounds, then the sound sequence 1 2 will be acceptable to the human 
ear regardless of the site being pointed to. 

Figure 6 shows a variation of the Figure 5 arrangement in which the functionality of 
5 equipm^t 10 is incorporated into a voice browser 33 located in tiie communications 
infiastmcture (for example, provided by a PSTN or PLMN operator or by an ISP). A voice 
browser allows people to access the Web using speech and is interposed between a user 32 
and a voice page server 60. This server 60 holds voice service pages (text pages) that are 
marked-up with tags of a voice-related markup language (or languages). When a page is 

10 requested by the user 32, it is interpreted at a top level (dialog level) by a dialog manager 
37 of the voice browser 33 and output intended for the user is passed in text form to a 
Text-To-Speech (TTS) converter 36 which provides appropriate voice output to the user. 
User voice input is converted to text by speech recognition module 35 of the voice browser 
33 and the dialog manager 37 determines what action is to be taken according to the 

1 S received input and the directions in the original page. Whatever its precise form, the voice 
browser can be located at any point between the user and the voice page server; in the 
present case, it is shown as located in the communications infrastructure. 

The sound channel between the user's equipment 3 1 (for example, a mobile phone) and the 
20 voice browser 33 permits a tone-encoded character sequence be passed to the browser. This 
tone sequence is intercepted by unit 38 and passed to functionality corresponding to units 
14, 1 5 and 1 6 in Figure S . If the tone sequence includes a general URL this is passed to the 
browser for action, whereas if the tone sequence includes a site code, the service system is 
accessed to determine the corresponding URL, the latter being returned and passed to the 
25 browser. 

In both the arrangements of Figures 5 and 6, the unit 16 preferably includes a cache which 
is used to store the site codes and their corresponding URLs received back from the service 
system 25. In this case, before the unit 16 accesses service system to get a translation of a 
30 newly-received site code, it first checks its cache to see if it already has the required URL 
in cache - if it does, the URL is passed to the browser without the service systern being 
accessed. 
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Many variants are, of coxirse, possible to the arrangements described above. For example, 
whilst the sound features used to represent the codewords 7 have been tones in the 
foregoing examples, the codewords could be used to produce a different type of sound 
feature, such as: 

tone combinations; 

occurrence of maximum sound output power in predetermined frequency bands; 
changes in output frequency; 

different modulation frequencies of one or more tones. 
Furthermore, the sound features can occur not only sequentially as described, but also in 
overlapping relation provided that it remains possible to determine character sequencing on 
decoding of the sound URL. 
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CLAIMS 

!• A method of encoding a URL in sound, wherein the characters of the URL are mapped 
5 to sound features in a sound output, the nature of the sound features and of the mapping 
between characters and sound features being such that at least certain character 
combinations that occur frequently in URLs produce soimd sequences of a musical 
character. 

10 2. A method according to claim 1, wherein the characters of the URL are mapped to 
produce sound codewords each of which is used to produce, in a sound output, a sound 
feature particular to that codeword. 

3. A method according to claim 1 or claim 2, wherein the sound features comprise one of: 
IS - fixed-frequency tones or tone combinations; 

occurrence of maximum sound output power in predetermined frequency bands; 
changes in output frequency; 

different modulation frequencies of one or more tones. 

20 4. A method according to claim 2, wherein characters of the URL are taken in groups of a 
first number of characters to form a second number of sound codewords, said second 
nimiber being different from said first number. 

5. A method according to claim 4, wherein three characters each represented by eight bits 
25 are used to form four six-bit sound codewords. 

6. A method according to any one of the preceding claims, wherein the generic top-level 
domain names encode to soimd sequences of a musical character. 

30 7. A method according to any one of the preceding claims, wherein at least one URL 
encodes in its entirety to a sound sequence of a musical character. 
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8, A method of decoding a sound sequence into a URL, wherein sound features of the 
sound sequence are mapped to characters of the URL, the nature of the soxmd features and 
of the mapping between sound features and characters being such that sound sequences of 
a musical character represent at least certain character combinations that occur frequently 
in URLs. 




Sound Links 

5 

A method is provided of encoding a URL in sound. The characters of the URL (8) are 
mapped to sound codewords (7) each of which is used to produce, in a sound output, a 
sound feature (6) particular to that codeword, the nature of the sound features and of the 
overall mapping between characters and soimd features being such that at least certain 
10 character combinations that occur frequently in URLs produce sound sequences of a 
musical character. 
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(Fig. 3) 
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